Balanced SET levels favor the correct enhancer repertoire during cell fate acquisition

Within the chromatin, distal elements interact with promoters to regulate specific transcriptional programs. Histone acetylation, interfering with the net charges of the nucleosomes, is a key player in this regulation. Here, we report that the oncoprotein SET is a critical determinant for the levels of histone acetylation within enhancers. We disclose that a condition in which SET is accumulated, the severe Schinzel-Giedion Syndrome (SGS), is characterized by a failure in the usage of the distal regulatory regions typically employed during fate commitment. This is accompanied by the usage of alternative enhancers leading to a massive rewiring of the distal control of the gene transcription. This represents a (mal)adaptive mechanism that, on one side, allows to achieve a certain degree of differentiation, while on the other affects the fine and corrected maturation of the cells. Thus, we propose the differential in cis-regulation as a contributing factor to the pathological basis of SGS and possibly other the SET-related disorders in humans.

For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section.

n/a Confirmed
The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly The statistical test(s) used AND whether they are one-or two-sided Only common tests should be described solely by name; describe more complex techniques in the Methods section.
A description of all covariates tested A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors and reviewers. We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Portfolio guidelines for submitting code & software for further information.

Data
Policy information about availability of data All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: -Accession codes, unique identifiers, or web links for publicly available datasets -A description of any restrictions on data availability -For clinical datasets or third party data, please ensure that the statement adheres to our policy The ATAC-seq, ChIP-seq, Hi-C, RNA-seq and scMultiome data generated in this study have been deposited in the Gene Expression Omnibus (GEO) database in the are available at GSE212252. NPCs RNA-seq data are available at GSE171266. Control PSC ATAC-seq were obtained from GSE108248 https://docs.github.com/en/ repositories/archiving-a-github-repository/referencing-and-citing-content. NPCs and NPC-derived neurons Hi-C control dataset54 used for this publication were obtained from NIMH Repository & Genomics Resource, a centralized national biorepository for genetic studies of psychiatric disorders. The raw number associated with bar plots pertaining the associated figures, ATAC-seq peaks, ChIP-seq peaks, differentially expressed genes, RNA normalized counts, single cells matrices, functional enrichment results generated in the study are available in the source data and supplementary information files.

Human research participants
Policy information about studies involving human research participants and Sex and Gender in Research.

Recruitment
Describe how participants were recruited. Outline any potential self-selection bias or other biases that may be present and how these are likely to impact results.

Ethics oversight
Identify the organization(s) that approved the study protocol.
Note that full information on the approval of the study protocol must also be provided in the manuscript.

Field-specific reporting
Please select the one below that is the best fit for your research. If you are not sure, read the appropriate sections before making your selection.

Life sciences Behavioural & social sciences Ecological, evolutionary & environmental sciences
For a reference copy of the document with all sections, see nature.com/documents/nr-reporting-summary-flat.pdf

Life sciences study design
All studies must disclose on these points even when the disclosure is negative.

Sample size
No statistical methods were used to pre-determine sample sizes, but our sample sizes are similar to those reported in previous publications (see De

Replication
We repeated all experiments using at least three biological replicates over distinct independent experiments. We specified the number of biological replicates and independent experiments in the respective figure legends.
Randomization We plated the cells in a random distribution onto cell culture and multi-well plate positions, and randomly assigned them to experimental groups. We perfomed cell counting on random miscroscope view fields. Animals were randomly selected for experimental analysis from different litter with appropriate genotype. Covariates like sex were not relevant for this kind of experiments Blinding Data collection and analyses were not performed blind to the conditions due to obvious differences between groups. The same results have been repeated by multiple members of the research team.
Reporting for specific materials, systems and methods We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response.  Mice were maintained were maintained at the San Raffaele Scientific Institute institutional facility in a pathogen-free environment. Temperature and air flow were controlled and constant (T = 22°+/-2; RH = 55% +/-5). Light cycle of 12h phase was used (not inverted). Age used: animals of the pure lines and double transgenic were used for crossing in adult ages (2-6 months of ages). Experimental animals were analyzed at: a) embryonic stage Embryonic day 9.5 (E.9.5) and E10.5 (constitutive mutants) with investigation also at E12.5, E14.5 and P0 (with no mutants found); b) E14.5, P2 and P30 the brain specific mutants. Zebrafish (Danio rerio) embryos were obtained from natural matings of the Wild-type strain AB and raised in E3 medium at 28.5°C on a 14/10-hour light/dark cycle. All experimental procedures were carried out at the San Raffaele Scientific Institute Institutional facility and performed in accordance with experimental protocols approved by local Institutional Animal Care and Use Committees (IACUC).

Wild animals
No wild animals were used in this study

Reporting on sex
No sex based analysis were performed. Note that full information on the approval of the study protocol must also be provided in the manuscript.

ChIP-seq Data deposition
Confirm that both raw and final processed data have been deposited in a public database such as GEO.
Confirm that you have deposited or provided access to graph files (e.g. BED files) for the called peaks.